Clustering Multi-represented Objects Using Combination Trees

نویسندگان

  • Elke Achtert
  • Hans-Peter Kriegel
  • Alexey Pryakhin
  • Matthias Schubert
چکیده

When clustering complex objects, there often exist various feature transformations and thus multiple object representations. To cluster multi-represented objects, dedicated data mining algorithms have been shown to achieve improved results. In this paper, we will introduce combination trees for describing arbitrary semantic relationships which can be used to extend the hierarchical clustering algorithm OPTICS to handle multi-represented data objects. To back up the usability of our proposed method, we present encouraging results on real world data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Density-Based Clustering for Multi-Represented Objects

In recent years, the complexity of data objects in data mining applications has increased as well as their plain numbers. As a result, there exist various feature transformations and thus multiple object representations. For example, an image can be described by a text annotation, a color histogram and some texture features. To cluster thesemulti-represented objects, dedicated datamining algori...

متن کامل

Additive Simila Rity Trees

Similarity data can be represented by additive trees. In this model, objects are represented by the external nodes of a tree, and the dissimilarity between objects is the length of the path joining them. The additive tree is less restrictive than the ultrametric tree, commonly known as the hierarchical clustering scheme. The two representations are characterized and compared. A computer program...

متن کامل

Protection or Privacy? Data Mining and Personal Data

A multiclass classification method based on output design p. 15 Regularized semi-supervised classification on manifold p. 20 Similarity-based sparse feature extraction using local manifold learning p. 30 Generalized conditional entropy and a metric splitting criterion for decision trees p. 35 RNBL-MN : a recursive naive Bayes learner for sequence classification p. 45 TRIPPER : rule learning usi...

متن کامل

Advanced data mining techniques for compound objects

Knowledge Discovery in Databases (KDD) is the non-trivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in large data collections. The most important step within the process of KDD is data mining which is concerned with the extraction of the valid patterns. KDD is necessary to analyze the steady growing amount of data caused by the enhanced perf...

متن کامل

Clustering of Lidar Data Using Particle Swarm Optimization Algorithm in Urban Area

One of the fundamental steps in the transformation of the LIDAR data into the meaningful objects in urban area involves their segmentation into consistent units through a clustering process. Nevertheless, due to the scene complexity and the variety of objects in urban area, e.g. buildings, roads, and trees, it is clear that a clustering using only a single cue will not suffice. Considering the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006